The Preliminary Results of a Mandarin Dictation Machine Based Upon Chinese Natural Language Analysis
نویسندگان
چکیده
This paper describes the preliminary results of the first research effort toward a Mandarin dictation machine in the world for the input of Chinese characters to computers. Considering the special characteristics of Chinese language, syllables are chosen as the basic units for dictation. The machine is divided into two subsystems. The first is to recognize the syllables using speech signal processing techniques. Because every syllable can represent many different characters with completely different meaning, the second subsystem then identifies the exact characters from the syllables and corrects the errors in syllable recognition by first forming all possible words from the syllables then finding out one combination of the words which is grammatically valid in a sentence. The preliminary test results indicate that such a dictation machine is not only practically attractive, but technically achievable.
منابع مشابه
Golden Mandarin (I)-A real-time Mandarin speech dictation machine for Chinese language with very large vocabulary
AhtractThis paper describes the first successfully implemented real-time Mandarin dictation machine developed in the world which recognizes Mandarin speech with very large vocabulary and almost unlimited texts for the input of Chinese characters into computers. Considering the special characteristics of the Chinese language, syllables are chosen as the basic units for dictation. The machine is ...
متن کاملComplete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data
This correspondence presents the first known results of complete recognition of continuous Mandarin speech for the Chinese language with very large vocabulary but very limited training data. Various acoustic and linguistic processing techniques were developed, and a prototype system of a continuous speech Mandarin dictation machine has been successfully implemented. The best recognition accurac...
متن کاملTangerine: a large vocabulary Mandarin dictation system
The text input for non-alphabetic languages, such as Chinese, has been a decades-long problem. Chinese Dictation using large vocabulary speech recognition provides a convenient mode of text entry. In contrast to a character based Dictation system [5], a word-based Mandarin dictation system has been designed [3] (based on Apple's PlainTalk speech recognition technology [4]) for efficient entry o...
متن کاملEmpirical study of Mandarin Chinese discourse analysis: an event-based approach
Discourse analysis plays an important role in natural language understanding. Mandarin Chinese discourse, which has many different properties compared with English discourse, is still far behind in the construction of a basic computational model. In this paper, we propose an event model to elucidate anaphora and ellipsis in Mandarin Chinese. An event-based approach (EBA) based on the model is d...
متن کاملWill Input Style Affect Mandarin Short Messages in Mobile Device?: a Wizard of Oz Study
Speech input is a natural text entry method for handheld devices that are used in different contexts. We conducted an experiment to understand effects of input (speaking) style (phrasal vs. sentence input) on Chinese text entry rates and user satisfaction with other two variables: recognition rate (50%, 70% and 90%) and message length (10 vs. 20 characters). Wizard of Oz was applied in the expe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1987